Multidimensional Network
   HOME

TheInfoList



OR:

In
network theory Network theory is the study of graphs as a representation of either symmetric relations or asymmetric relations between discrete objects. In computer science and network science, network theory is a part of graph theory: a network can be defi ...
, multidimensional networks, a special type of ''multilayer network'', are networks with multiple kinds of relations. Increasingly sophisticated attempts to model real-world systems as multidimensional networks have yielded valuable insight in the fields of
social network analysis Social network analysis (SNA) is the process of investigating social structures through the use of networks and graph theory. It characterizes networked structures in terms of ''nodes'' (individual actors, people, or things within the network) a ...
, economics, urban and international
transport Transport (in British English), or transportation (in American English), is the intentional movement of humans, animals, and goods from one location to another. Modes of transport include air, land (rail and road), water, cable, pipeline, an ...
,
ecology Ecology () is the study of the relationships between living organisms, including humans, and their physical environment. Ecology considers organisms at the individual, population, community, ecosystem, and biosphere level. Ecology overlaps wi ...
, psychology, medicine, biology, commerce, climatology, physics,
computational neuroscience Computational neuroscience (also known as theoretical neuroscience or mathematical neuroscience) is a branch of neuroscience which employs mathematical models, computer simulations, theoretical analysis and abstractions of the brain to u ...
, operations management, and finance.


Terminology

The rapid exploration of
complex networks Complex Networks is an American media and entertainment company for youth culture, based in New York City. It was founded as a bi-monthly magazine, ''Complex'', by fashion designer Marc (Ecko) Milecofsky. Complex Networks reports on popular a ...
in recent years has been dogged by a lack of standardized naming conventions, as various groups use overlapping and contradictory terminology to describe specific network configurations (e.g., multiplex, multilayer, multilevel, multidimensional, multirelational, interconnected). Formally, multidimensional networks are edge-labeled
multigraph In mathematics, and more specifically in graph theory, a multigraph is a graph which is permitted to have multiple edges (also called ''parallel edges''), that is, edges that have the same end nodes. Thus two vertices may be connected by more ...
s. The term "fully multidimensional" has also been used to refer to a
multipartite Multipartite is a class of virus that have segmented nucleic acid genomes, with each segment of the genome enclosed in a separate viral particle. Only a few ssDNA viruses have multipartite genomes, but a lot more RNA viruses have multipartite geno ...
edge-labeled multigraph. Multidimensional networks have also recently been reframed as specific instances of multilayer networks. In this case, there are as many layers as there are dimensions, and the links between nodes within each layer are simply all the links for a given dimension.


Definition


Unweighted multilayer networks

In elementary network theory, a network is represented by a graph G = (V,E) in which V is the set of
nodes In general, a node is a localized swelling (a "knot") or a point of intersection (a Vertex (graph theory), vertex). Node may refer to: In mathematics *Vertex (graph theory), a vertex in a mathematical graph *Vertex (geometry), a point where two ...
and E the links between nodes, typically represented as a
tuple In mathematics, a tuple is a finite ordered list (sequence) of elements. An -tuple is a sequence (or ordered list) of elements, where is a non-negative integer. There is only one 0-tuple, referred to as ''the empty tuple''. An -tuple is defi ...
of nodes u,v\in V. While this basic formalization is useful for analyzing many systems, real world networks often have added complexity in the form of multiple types of relations between system elements. An early formalization of this idea came through its application in the field of social network analysis (see, e.g., and papers on relational algebras in social networks) in which multiple forms of social connection between people were represented by multiple types of links. To accommodate the presence of more than one type of link, a multidimensional network is represented by a triple G = (V,E,D), where D is a set of dimensions (or layers), each member of which is a different type of link, and E consists of triples (u,v,d) with u,v\in V and d\in D. Note that as in all
directed graph In mathematics, and more specifically in graph theory, a directed graph (or digraph) is a graph that is made up of a set of vertices connected by directed edges, often called arcs. Definition In formal terms, a directed graph is an ordered pa ...
s, the links (u,v,d) and (v,u,d) are distinct. By convention, the number of links between two nodes in a given dimension is either 0 or 1 in a multidimensional network. However, the total number of links between two nodes across all dimensions is less than or equal to , D, .


Weighted multilayer networks

In the case of a
weighted network A weighted network is a network where the ties among nodes have weights assigned to them. A network is a system whose elements are somehow connected. The elements of a system are represented as nodes (also known as actors or vertices) and the connec ...
, this triplet is expanded to a quadruplet e = (u,v,d,w), where w is the weight on the link between u and v in the dimension d. ]Further, as is often useful in social network analysis, link weights may take on positive or negative values. Such signed networks can better reflect relations like amity and enmity in social networks. Alternatively, link signs may be figured as dimensions themselves, e.g. G = (V,E,D) where D=\ and E = \ This approach has particular value when considering unweighted networks. This conception of dimensionality can be expanded should attributes in multiple dimensions need specification. In this instance, links are ''n''-tuples e = (u,v,d_1 \dots d_). Such an expanded formulation, in which links may exist within multiple dimensions, is uncommon but has been used in the study of multidimensional
time-varying network A temporal network, also known as a time-varying network, is a network whose links are active only at certain points in time. Each link carries information on when it is active, along with other possible characteristics such as a weight. Time-var ...
s.


General formulation in terms of tensors

Whereas unidimensional networks have two-dimensional adjacency matrices of size V\times V, in a multidimensional network with D dimensions, the adjacency matrix becomes a multilayer adjacency tensor, a four-dimensional matrix of size (V\times D)\times (V\times D). By using
index notation In mathematics and computer programming, index notation is used to specify the elements of an array of numbers. The formalism of how indices are used varies according to the subject. In particular, there are different methods for referring to th ...
, adjacency matrices can be indicated by A^_, to encode connections between nodes i and j, whereas multilayer adjacency tensors are indicated by M^_, to encode connections between node i in layer \alpha and node j in layer \beta. As in unidimensional matrices, directed links, signed links, and weights are all easily accommodated by this framework. In the case of multiplex networks, which are special types of multilayer networks where nodes can not be interconnected with other nodes in other layers, a three-dimensional matrix of size (V\times V)\times D with entries A^_ is enough to represent the structure of the system by encoding connections between nodes i and j in layer \alpha. ]


Multidimensional network-specific definitions


Multi-layer neighbors

In a multidimensional network, the neighbors of some node v are all nodes connected to v across dimensions.


Multi-layer path length

A Path (graph theory), path between two nodes in a multidimensional network can be represented by a vector r =(r_1, \dots r_) in which the ith entry in r is the number of links traversed in the ith dimension of G.M. Magnani, A. Monreale, G. Rossetti, F. Giannotti: "On multidimensional network measures", SEBD 2013, Rocella Jonica, Italy As with overlapping degree, the sum of these elements can be taken as a rough measure of a path length between two nodes.


Network of layers

The existence of multiple layers (or dimensions) allows to introduce the new concept of ''network of layers'', peculiar of multilayer networks. In fact, layers might be interconnected in such a way that their structure can be described by a network, as shown in the figure. The network of layers is usually weighted (and might be directed), although, in general, the weights depends on the application of interest. A simple approach is, for each pair of layers, to sum all of the weights in the connections between their nodes to obtain edge weights that can be encoded into a matrix q_. The rank-2 adjacency tensor, representing the underlying network of layers in the space \mathbb^ is given by \Psi^_=\sum\limits_^q_E^_(\alpha\beta) where E^_(\alpha\beta) is the canonical matrix with all components equal to zero except for the entry corresponding to row \alpha and column \beta, that is equal to one. Using the tensorial notation, it is possible to obtain the (weighted) network of layers from the multilayer adjacency tensor as \Psi^_=M^_U_^.


Centrality measures


Degree

In a non-interconnected multidimensional network, where interlayer links are absent, the
degree Degree may refer to: As a unit of measurement * Degree (angle), a unit of angle measurement ** Degree of geographical latitude ** Degree of geographical longitude * Degree symbol (°), a notation used in science, engineering, and mathematics ...
of a node is represented by a vector of length , D, : \mathbf = (k^_i,\dots k^_i). Here , D, is an alternative way to denote the number of layers L in multilayer networks. However, for some computations it may be more useful to simply sum the number of links adjacent to a node across all dimensions. This is the ''overlapping degree'': \sum_^ k^_i. As with unidimensional networks, distinction may similarly be drawn between incoming links and outgoing links. If interlayer links are present, the above definition must be adapted to account for them, and the ''multilayer degree'' is given by k^=M^_U_^u^=\sum_^\sum_^M^_ where the tensors U_^ and u^ have all components equal to 1. The heterogeneity in the number of connections of a node across the different layers can be taken into account through the participation coefficient.


Versatility as multilayer centrality

When extended to interconnected multilayer networks, i.e. those systems where nodes are connected across layers, the concept of centrality is better understood in terms of versatility. Nodes that are not central in each layer might be the most important for the multilayer systems in certain scenarios. For instance, this is the case where two layers encode different networks with only one node in common: it is very likely that such a node will have the highest centrality score because it is responsible for the information flow across layers.


Eigenvector versatility

As for unidimensional networks, eigenvector versatility can be defined as the solution of the eigenvalue problem given by M^_\Theta_=\lambda_ \Theta_, where
Einstein summation convention In mathematics, especially the usage of linear algebra in Mathematical physics, Einstein notation (also known as the Einstein summation convention or Einstein summation notation) is a notational convention that implies summation over a set of i ...
is used for sake of simplicity. Here, \Theta_ = \lambda_^M^_\Theta_ gives the multilayer generalization of Bonacich's eigenvector centrality per node per layer. The overall eigenvector versatility is simply obtained by summing up the scores across layers as \theta_=\Theta_u^.


Katz versatility

As for its unidimensional counterpart, the Katz versatility is obtained as the solution \Phi_= \delta-aM)^_U_ of the tensorial equation \Phi_=aM^_\Phi_+bu_, where \delta^_=\delta^_\delta^_, a is a constant smaller than the largest eigenvalue and b is another constant generally equal to 1. The overall Katz versatility is simply obtained by summing up the scores across layers as \phi_=\Phi_u^.


HITS versatility

For unidimensional networks, the
HITS algorithm Hyperlink-Induced Topic Search (HITS; also known as hubs and authorities) is a link analysis algorithm that rates Web pages, developed by Jon Kleinberg. The idea behind Hubs and Authorities stemmed from a particular insight into the creation of web ...
has been originally introduced by
Jon Kleinberg Jon Michael Kleinberg (born 1971) is an American computer scientist and the Tisch University Professor of Computer Science and Information Science at Cornell University known for his work in algorithms and networks. He is a recipient of the Nevanl ...
to rate Web Pages. The basic assumption of the algorithm is that relevant pages, named authorities, are pointed by special Web pages, named hubs. This mechanism can be mathematically described by two coupled equations which reduce to two eigenvalue problems. When the network is undirected, Authority and Hub centrality are equivalent to eigenvector centrality. These properties are preserved by the natural extension of the equations proposed by Kleinberg to the case of interconnected multilayer networks, given by (M M^)^_ \Gamma_ = \lambda_ \Gamma_ and (M^ M)^_ \Upsilon_ = \lambda_ \Upsilon_, where t indicates the transpose operator, \Gamma_ and \Upsilon_ indicate hub and authority centrality, respectively. By contracting the hub and authority tensors, one obtains the overall versatilities as \gamma_ = \Gamma_u^ and \upsilon_ = \Upsilon_u^, respectively.


PageRank versatility

PageRank PageRank (PR) is an algorithm used by Google Search to rank web pages in their search engine results. It is named after both the term "web page" and co-founder Larry Page. PageRank is a way of measuring the importance of website pages. According ...
, better known as ''Google Search Algorithm'' is another measure of centrality in complex networks, originally introduced to rank Web pages. Its extension to the case of interconnected multilayer networks can be obtained as follows. First, it is worth remarking that
PageRank PageRank (PR) is an algorithm used by Google Search to rank web pages in their search engine results. It is named after both the term "web page" and co-founder Larry Page. PageRank is a way of measuring the importance of website pages. According ...
can be seen as the steady-state solution of a special
Markov process A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, this may be thought of as, "What happe ...
on the top of the network.
Random walk In mathematics, a random walk is a random process that describes a path that consists of a succession of random steps on some mathematical space. An elementary example of a random walk is the random walk on the integer number line \mathbb Z ...
ers explore the network according to a special transition matrix and their dynamics is governed by a random walk
master equation In physics, chemistry and related fields, master equations are used to describe the time evolution of a system that can be modelled as being in a probabilistic combination of states at any given time and the switching between states is determine ...
. It is easy to show that the solution of this equation is equivalent to the leading eigenvector of the transition matrix. Random walks have been defined also in the case of interconnected multilayer networks and edge-colored multigraphs (also known as multiplex networks). For interconnected multilayer networks, the transition tensor governing the dynamics of the random walkers within and across layers is given by R^_ = rT^_ + \fracu^_,, where r is a constant, generally set to 0.85, N is the number of nodes and L is the number of layers or dimensions. Here, R^_ might be named ''Google tensor'' and u^_ is the rank-4 tensor with all components equal to 1. As its unidimensional counterpart, PageRank versatility consists of two contributions: one encoding a classical random walk with rate r and one encoding teleportation across nodes and layers with rate 1-r. If we indicate by \Omega_ the eigentensor of the Google tensor R^_, denoting the steady-state probability to find the walker in node i and layer \alpha, the multilayer PageRank is obtained by summing up over layers the eigentensor: \omega_ = \Omega_u^


Triadic closure and clustering coefficients

Like many other network statistics, the meaning of a
clustering coefficient In graph theory, a clustering coefficient is a measure of the degree to which nodes in a graph tend to cluster together. Evidence suggests that in most real-world networks, and in particular social networks, nodes tend to create tightly knit groups ...
becomes ambiguous in multidimensional networks, due to the fact that triples may be closed in different dimensions than they originated. Several attempts have been made to define local clustering coefficients, but these attempts have highlighted the fact that the concept must be fundamentally different in higher dimensions: some groups have based their work off of non-standard definitions, while others have experimented with different definitions of random walks and 3-cycles in multidimensional networks.


Community discovery

While cross-dimensional structures have been studied previously, they fail to detect more subtle associations found in some networks. Taking a slightly different take on the definition of "community" in the case of multidimensional networks allows for reliable identification of communities without the requirement that nodes be in direct contact with each other. For instance, two people who never communicate directly yet still browse many of the same websites would be viable candidates for this sort of algorithm.


Modularity maximization

A generalization of the well-known modularity maximization method for community discovery has been originally proposed by Mucha et al. This multiresolution method assumes a three-dimensional tensor representation of the network connectivity within layers, as for edge-colored multigraphs, and a three-dimensional tensor representation of the network connectivity across layers. It depends on the resolution parameter \gamma and the weight \omega of interlayer connections. In a more compact notation, making use of the tensorial notation, modularity can be written as Q \propto S^_B^_S^_, where B^_=M^_ - P^_, M^_ is the multilayer adjacency tensor, P^_ is the tensor encoding the null model and the value of components of S^_ is defined to be 1 when a node i in layer \alpha belongs to a particular community, labeled by index a, and 0 when it does not.


Tensor decomposition

Non-negative matrix factorization Non-negative matrix factorization (NMF or NNMF), also non-negative matrix approximation is a group of algorithms in multivariate analysis and linear algebra where a matrix is factorized into (usually) two matrices and , with the property that ...
has been proposed to extract the community-activity structure of temporal networks. The multilayer network is represented by a three-dimensional tensor T^_, like an edge-colored multigraph, where the order of layers encode the arrow of time. Tensor factorization by means of Kruskal decomposition is thus applied to T^_ to assign each node to a community across time.


Statistical inference

Methods based on statistical inference, generalizing existing approaches introduced for unidimensional networks, have been proposed. Stochastic block model is the most used generative model, appropriately generalized to the case of multilayer networks. As for unidimensional networks, principled methods like
minimum description length Minimum Description Length (MDL) is a model selection principle where the shortest description of the data is the best model. MDL methods learn through a data compression perspective and are sometimes described as mathematical applications of Occam ...
can be used for model selection in community detection methods based on information flow.


Structural reducibility

Given the higher complexity of multilayer networks with respect to unidimensional networks, an active field of research is devoted to simplify the structure of such systems by employing some kind of dimensionality reduction. A popular method is based on the calculation of the quantum Jensen-Shannon divergence between all pairs of layers, which is then exploited for its metric properties to build a distance matrix and hierarchically cluster the layers. Layers are successively aggregated according to the resulting hierarchical tree and the aggregation procedure is stopped when the
objective function In mathematical optimization and decision theory, a loss function or cost function (sometimes also called an error function) is a function that maps an event or values of one or more variables onto a real number intuitively representing some "cos ...
, based on the entropy of the network, gets a global maximum. This greedy approach is necessary because the underlying problem would require to verify all possible layer groups of any size, requiring a huge number of possible combinations (which is given by the
Bell number In combinatorial mathematics, the Bell numbers count the possible partitions of a set. These numbers have been studied by mathematicians since the 19th century, and their roots go back to medieval Japan. In an example of Stigler's law of eponymy ...
and scales super-exponentially with the number of units). Nevertheless, for multilayer systems with a small number of layers, it has been shown that the method performs optimally in the majority of cases.


Other multilayer network descriptors


Degree correlations

The question of degree correlations in unidimensional networks is fairly straightforward: do networks of similar degree tend to connect to each other? In multidimensional networks, what this question means becomes less clear. When we refer to a node's degree, are we referring to its degree in one dimension, or collapsed over all? When we seek to probe connectivity between nodes, are we comparing the same nodes across dimensions, or different nodes within dimensions, or a combination? What are the consequences of variations in each of these statistics on other network properties? In one study, assortativity was found to decrease robustness in a duplex network.


Path dominance

Given two multidimensional paths, r and s, we say that r ''dominates'' s if and only if: \forall d\in \langle 1,, D, \rangle , r_l \leq s_l and \exists i such that r_l < s_l.


Shortest path discovery

Among other network statistics, many centrality measures rely on the ability to assess shortest paths from node to node. Extending these analyses to a multidimensional network requires incorporating additional connections between nodes into the algorithms currently used (e.g., Dijkstra's). Current approaches include collapsing multi-link connections between nodes in a preprocessing step before performing variations on a
breadth-first search Breadth-first search (BFS) is an algorithm for searching a tree data structure for a node that satisfies a given property. It starts at the tree root and explores all nodes at the present depth prior to moving on to the nodes at the next de ...
of the network.


Multidimensional distance

One way to assess the distance between two nodes in a multidimensional network is by comparing all the multidimensional paths between them and choosing the subset that we define as shortest via path dominance: let MP(u,v) be the set of all paths between u and v. Then the distance between u and v is a set of paths P\subseteq MP such that \forall p\in P, \nexists p'\in MP such that p' dominates p. The length of the elements in the set of shortest paths between two nodes is therefore defined as the ''multidimensional distance''.


Dimension relevance

In a multidimensional network G = (V,E,D), the relevance of a given dimension (or set of dimensions) D' for one node can be assessed by the ratio: \frac.


Dimension connectivity

In a multidimensional network in which different dimensions of connection have different real-world values, statistics characterizing the distribution of links to the various classes are of interest. Thus it is useful to consider two metrics that assess this: dimension connectivity and edge-exclusive dimension connectivity. The former is simply the ratio of the total number of links in a given dimension to the total number of links in every dimension: \frac. The latter assesses, for a given dimension, the number of pairs of nodes connected only by a link in that dimension: \frac.


Burst detection

Burstiness In statistics, burstiness is the intermittent increases and decreases in activity or frequency of an event.Lambiotte, R. (2013.) "Burstiness and Spreading on Temporal Networks", University of Namur. One of measures of burstiness is the Fano factorâ ...
is a well-known phenomenon in many real-world networks, e.g. email or other human communication networks. Additional dimensions of communication provide a more faithful representation of reality and may highlight these patterns or diminish them. Therefore, it is of critical importance that our methods for detecting bursty behavior in networks accommodate multidimensional networks.


Diffusion processes on multilayer networks

Diffusion process In probability theory and statistics, diffusion processes are a class of continuous-time Markov process with almost surely continuous sample paths. Brownian motion, reflected Brownian motion and Ornstein–Uhlenbeck processes are examples of diff ...
es are widely used in
physics Physics is the natural science that studies matter, its fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge which r ...
to explore physical systems, as well as in other disciplines as social sciences, neuroscience, urban and international transportation or finance. Recently, simple and more complex diffusive processes have been generalized to multilayer networks. One result common to many studies is that diffusion in multiplex networks, a special type of multilayer system, exhibits two regimes: 1) the weight of inter-layer links, connecting layers each other, is not high enough and the multiplex system behaves like two (or more) uncoupled networks; 2) the weight of inter-layer links is high enough that layers are coupled each other, raising unexpected physical phenomena. It has been shown that there is an abrupt transition between these two regimes. In fact, all network descriptors depending on some diffusive process, from centrality measures to community detection, are affected by the layer-layer coupling. For instance, in the case of community detection, low coupling (where information from each layer separately is more relevant than the overall structure) favors clusters within layers, whereas high coupling (where information from all layer simultaneously is more relevant than the each layer separately) favors cross-layer clusters.


Random walks

As for unidimensional networks, it is possible to define random walks on the top of multilayer systems. However, given the underlying multilayer structure, random walkers are not limited to move from one node to another within the same layer (''jump''), but are also allowed to move across layers (''switch''). Random walks can be used to explore a multilayer system with the ultimate goal to unravel its mesoscale organization, i.e. to partition it in
communities A community is a Level of analysis, social unit (a group of living things) with commonality such as place (geography), place, Norm (social), norms, religion, values, Convention (norm), customs, or Identity (social science), identity. Communiti ...
, and have been recently used to better understand navigability of multilayer networks and their resilience to random failures, as well as for exploring efficiently this type of topologies. In the case of interconnected multilayer systems, the probability to move from a node i in layer \alpha to node j in layer \beta can be encoded into the rank-4 transition tensor T^_ and the discrete-time walk can be described by the master equation p_(t+1)=\sum_^\sum_^T^_p_(t)=\sum_^\sum_^(T^)^_p_(0) where p_(t) indicates the probability of finding the walker in node i in layer \alpha at time t. There are many different types of walks that can be encoded into the transition tensor T^_, depending on how the walkers are allowed to jump and switch. For instance, the walker might either jump or switch in a single time step without distinguishing between inter- and intra-layer links (''classical random walk''), or it can choose either to stay in the current layer and jump, or to switch layer and then jump to another node in the same time step (''physical random walk''). More complicated rules, corresponding to specific problems to solve, can be found in the literature. In some cases, it is possible to find, analytically, the stationary solution of the master equation.


Classical diffusion

The problem of classical diffusion in complex networks is to understand how a quantity will flow through the system and how much time it will take to reach the stationary state. Classical diffusion in multiplex networks has been recently studied by introducing the concept of supra-adjacency matrix, later recognized as a special
flattening Flattening is a measure of the compression of a circle or sphere along a diameter to form an ellipse or an ellipsoid of revolution (spheroid) respectively. Other terms used are ellipticity, or oblateness. The usual notation for flattening is ...
of the multilayer adjacency tensor. In tensorial notation, the diffusion equation on the top of a general multilayer system can be written, concisely, as \frac=-L^_X_(t) where X_(t) is the amount of diffusing quantity at time t in node i in layer \alpha. The rank-4 tensor governing the equation is the Laplacian tensor, generalizing the combinatorial Laplacian matrix of unidimensional networks. It is worth remarking that in non-tensorial notation, the equation takes a more complicated form. Many of the properties of this diffusion process are completely understood in terms of the second smallest eigenvalue of the Laplacian tensor. It is interesting that diffusion in a multiplex system can be faster than diffusion in each layer separately, or in their aggregation, provided that certain spectral properties are satisfied.


Information and epidemics spreading

Recently, how information (or diseases) spread through a multilayer system has been the subject of intense research.


References

{{reflist, 30em Networks Network theory